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ABSTRACT 

The dynamical state of galaxy groups at intermediate redshifts can provide information about the 
growth of structure in the universe. We examine three goodness-of-fit tests, the Anderson-Darling 
(A-D), Kolmogorov and x 2 tests, in order to determine which statistical tool is best able to distinguish 
between groups that are relaxed and those that are dynamically complex. We perform Monte Carlo 
simulations of these three tests and show that the % 2 test is profoundly unreliable for groups with 
fewer than 30 members. Power studies of the Kolmogorov and A-D tests are conducted to test their 
robustness for various sample sizes. We then apply these tests to a sample of the second Canadian 
Network for Observational Cosmology Redshift Survey (CNOC2) galaxy groups and find that the 
A-D test is far more reliable and powerful at detecting real departures from an underlying Gaussian 
distribution than the more commonly used x 2 and Kolmogorov tests. We use this statistic to classify 
a sample of the CNOC2 groups and find that 34 of 106 groups are inconsistent with an underlying 
Gaussian velocity distribution, and thus do not appear relaxed. In addition, we compute velocity 
dispersion profiles (VDPs) for all groups with more than 20 members and compare the overall features 
of the Gaussian and non-Gaussian groups, finding that the VDPs of the non-Gaussian groups are 
distinct from those classified as Gaussian. 
Subject headings: galaxies: interactions-galaxies: statistics 



1. INTRODUCTION 

The group environment represents an intermediate 
size and density scale between individual galaxies and 
rich galaxy clusters. With roug hly half of the present- 
day galaxy popu lation in groups (jGeller fc Huchralll983l : 
lEke et all l2005f ) , this environment plays an impor- 
tant role in the evolution of galaxies. The rela- 
tively low velocity dispersions of groups provides an 
ideal environment for galaxy i nteractions (iBarnes 1985: 
IZabludoff fc Mulchaevl Il998t iBrough et all 120061 ) and 
studying the dynamics of galaxy groups is one way to 
probe how dominant these interactions are in the group 
environment. 

The dynamics of rich clusters are often studied us- 
ing extended X-ray emission, which provides informa- 
tion about the potential well of the cluster. Since not 
all groups are sufficiently massive and evolved that their 
X-ray emission can be detected (|Zabludoff fc Mulchaevl 
Il998t iMulchaevI l2000t ). we must use another tracer of 
galactic dynamics. Another method of characterizing the 
dynamical state of gravitating systems such as galaxy 
groups, clusters, cores in molecular clouds, or star clus- 
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ters involves studying their velocity distributions. The 
standard assumption is that the underlying distribu- 
tion is Gaussian in nature, but this is strictly true 
only for systems in dynamical equilibrium. Groups 
with non-Gaussian velocity dist ributions could mark sys- 
tems in the process of a merger ([Menci fc Fusco-Femianol 
1996) or those that are in the early stages of evolu- 
tion. However, the interpretation of galaxy group dy- 
namics can be furthe r complicated by projection effects 
(jRamella et al.l Il989fl. the possible inc lusion of inter- 
loping galaxies (|Robotham et al.l l2008h and the direc- 
tion of group elongation re lative to the line of sight 
(jTovmassian fc Plionisi r2009). To properly study group 
(or cluster) dynamics we need a reliable method of dis- 
tinguishing between relaxed systems with Gaussian ve- 
locity distributions and more complex systems with non- 
Gaussian dynamics. 

Previous analyses of cluster velocity distributions have 
result ed in conflicting views on the dynamics of these sys- 
tems. lYahil fc Vidall (|1977f ) used the a-test, it-test and 
Shapiro-Wilks W-test for non-normality to show that 
the observed radial velocity distributions of clusters of 
galaxies, with as few as 10 and as many as 122 mem- 
bers, are always consistent with an underlying Gaussian 
distribution. More rece ntly, evidence of substructure has 
been found in clusters (|Dressler fc Sheet man! 1 19881 : iBirdl 
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Il994t lBurnslll998f ). indicating that cluster dynamics may 
b e more comp l icated than initially assumed. 

iBeers et al.l (|1990f ) emphasize the difficulty in deter- 
mining that a given velocity distribution differs signif- 
icantly from Gaussian, stating tha t the goodness-of-fit 
tests used by lYahil fc Vidal (| 19771 ) detect different de- 
partures (e.g., skewed or shifted distributions) from a 
true Gaussian distribution. Thus, a system may be clas- 
sified as either Gaussian or non-Gaussian depending on 
the statistic used. These difficulties are more severe when 
studying smaller systems, as in the case with galaxy 
groups. Since group membership can range from 3 to 
50 or more galaxies, we need a statistical test that is 
reliable even for extremely small sample sizes. We also 
require a test that is robust, or unaffected by small de- 
partures from normality, to ensure that the rejections are 
a result of real deviations from a Gaussian distribution 
and not sensitivities inherent to the test. The goodness- 
of-fit tests used to analyze rich clusters are generally not 
applicable to groups, where the challenges of small num- 
ber statistics become relevant. 

In this paper we test three goodness-of-fit techniques in 
order to determine which one in particular is best able to 
distinguish between Gaussian and non-Gaussian velocity 
distributions, especially for small samples. In Section 2, 
we discuss the statistical tests we use to determine depar- 
tures from Gaussianity and present the results of Monte 
Carlo simulations of the x 2 , Kolmogorov and Anderson- 
Darling (A-D) tests, as well as the results of our power 
studies of the Kolmogorov and A-D tests. In Section 
3, we apply the goodness-of-fit tests to the Canadian 
Network for Observational Cosmology (CNOC2) galaxy 
group data and compare the results. In Section 4 we ap- 
ply the A-D test to the CNOC2 groups and classify the 
dynamical states of the groups. In Section 5, we com- 
pare velocity dispersion profiles (VDPs) of the classified 
Gaussian and non-Gaussian galaxy groups and discuss 
the implications of our results and in §6, we summarize 
our results 

Throughout this paper, we assume a ACDM cosmology 
with n M = 0.3, fi A = 0.7 and H = 75 km s" 1 Mpc" 1 . 

2. STATISTICAL TOOLS 

2.1. Goodness-of-Fits Tests 

The Pearson's x 2 test (Equation [T]) is arguably the 
most commonly used goodness-of-fit test. However, this 
statistic was developed as a large sample theory and its 
reliability begins to break down as one approaches small 
sampl e sizes (n). For small % iD'Agostino fc Stephens! 
(1986) (hereafter DA86) suggest the use of goodness- 
of-fits tests based on empirical distribution functions 
(EDFs), such as the A-D and the Kolmogorov tests. 

a.) Pearson's x 2 Test 
The Pearson's x 2 test is defined as: 

2 (observed^ - expected^) 2 . . 

* ^ expectedi 

The critical points of the \ 2 test were developed for 
large samples and it is difficult to determine exactly 



where, at small n, this theory begins to break down 
(DA86). Although there is no standard method to ap- 
ply the x 2 test, it is often suggested that the bin widths 
be constructed such that they have the same number of 
data p oints in each bin. For small sample sizes Vesscrcau 
(1958) claims that it is not the number of data points per 
bin tha t is important bu t rather the total number of data 
points. IVessere au (1958) also finds that as long as n > 10 
and one uses the 1% or 5% critical values, the x 2 test 
should not produce a significant amo unt of error (with 
respec t to false positives or negatives). iKoehler fc Larntzl 
(1980) claim that for k-1 degrees of freedom, the Pear- 
son's x 2 tes t is 'reasonably a dequa te' when fc > 3 and 
n > 10 and iRoscoe fc Bvarsl (| 1 9 7 If) also find that when 
the degrees of freedom are > 1, the x 2 test does not pro- 
duce significant false positives, remaining robust against 
these type of errors. 

The x 2 test is often performed on binned data, test- 
ing the variance between histograms and a continuous 
Gaussian distribution. Unfortunately, binning can add 
additional errors to the test, a problem which becomes 
worse at small n. One source of error is the choice of bin 
width, which can alter the results of the x 2 test, even 
causing the same data to be classified as both Gaussian 
and non-Gau ssian, depending on the b in width. Several 
authors (e.g. iHealdl 119841 : IScottl I1979T ) have attempted 
to reduce this effect by minimizing the sampling fluc- 
tuations. With testing, Heald's optimal bin width was 
selected, where the bin width is defined as: 



§x = a f^\ 1/5 (2 ) 

where a is the standard deviation. 

The degrees of freedom in the system are given by 
DOF = N — 1 — k, where N is the number of bins 
and k is the number of free parameters in the assumed 
distribution. Thus, for a Gaussian distribution there are 
two free parameters (fc — 2), /i and a, and the minimum 
number of bins is N = 4. 



b.) The Kolmogorov Test 

The popular Kolmogorov test (DA86, and references 
therein) is a goodness-of-fit test based on supremum 
statistics, which measures the vertical difference between 
the EDF, F n (x), of the ordered data Xi and the cumula- 
tive distribution function (CDF), F(x), of a given model 
1 . The EDF statistic computed for the Kolmogorov test 
is the D value, which is derived from the D + and D~ 
values, and is defined as: 



D = max(D + , D~) (3) 

D + = supremuml F(x)\, (4) 

n 

1 It should be noted that the Kolmogorov-Smirnov (K-S) test is 
technically the difference between two EDFs, while the Kolmogorov 
test measures the difference between a model CDF and EDF. De- 
spite this distinction, the Kolmogorov test is often referred to as 
the "K-S test", here we will refer to it by it's proper name. 
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D = supremum|i ;l (a;) 1, (5) 

n 

where F n {x) = ± for D+ F n (x) = ^ for D~, 
and 1 < i < n. I Stephens! (|1974l ) has simplified the 
Kolmogorov test with the modification of the D values, 
called the D* value (Equation 0, which allows for 
comparison with one critical value table, rather than 
computing critical values for sp ecific sample sizes and 
significance levels (|Massevlll95lD . 



D* =D L/^+0.12 + __ \ ( 6 ) 

The use of the Kolmogorov test i n place of the \ 2 test 
for small samples is suggested by iLillieforsI (|1967l). In 
a com parison of the Kolmogorov and x 2 tests, iMassevI 
(1951) concludes that the Kolmogorov test is generally 
more reliable than the \ 2 test, especially for small n, 
where the effects of binning, required by the % 2 test but 
not the Kolmogorov test, can result in a large loss of 
information. 

c.) The Anderson-Darling Test 

Like the Kolmogorov test, the A-D test is also based on 
EDF statistics and does not require binning or graphical 
analysis. Despite these advantages, the A-D test is not 
commonly used in astronomy. The A-D statistic involves 
the calculation of the A 2 and A 2 * values, starting from 
the ordered data {xi}: 



computed mean velocity of the group and a is the 
computed velocity dispersion. The <5>(xi) values are 
then used in the A-D computing formulas (Equations 
|7J) and {8])) to obtain the A 2 * values. These values can 
either be compared to known critical or limiting value 
tables, or used to compute the significance level a, which 
gives the probability of the null hypothesis (i.e., the 
underlying distribution is Gaussian) being true. A more 
detailed discussion of critical values and significance 
levels is given in Section 4. DA86 recommend the 
A-D test as the 'omnibus' test for EDF statistics when 
the underlying distribution is believed to be Gaussian. 
Furthermore, they claim that the A 2 and A 2 * values can 
be reliably computed down to n = 5. 

A method of quantifying the robustness of statistical 
tests involves conducting power studies, which investi- 
gate the percentage of false positives a given test may 
produce when the underlying distribution is distorted 
(i.e., skewed, shifted, wings, etc.). For the Gaussian 
distribution, this involves applying the tests to a variety 
of non-Gaussian samples and determining not only 
how often a specified test will reject the distribution as 
Gaussian, but also the specific types of departures from 
non-normality that affect the rejection rate (DA86, and 
references therein). In a comparison of the Kolmogorov, 
Cramer- von M ises, Kuiper, Watson and A-D tests, 
I Stephens! (| 1974( 1 conducted power studies using a variety 
of non-Gaussian distributions and found the A-D test 
to be most powerful of the EDF statistics for detecting 
departures from Gaussianity, while the Kolmogorov 
test proved to be least powerful. As for the x 2 test, 
DA86 claim that it is in general not a powerful test for 
Gaussian distributions and do not recommend its use. 
We discuss the results of our power studies in Section 2.3. 



A 2 = -n - - V(2i - l)(ln*(a;0 + ln(l - $(x„+i_i))), 
n A — ' 

i—l 

(7) 



^=Wi + ^ + ^a ( 8) 

\ n n z J 

where Xi < x < Xi+i and Q(xi) is the CDF of the hypo- 
thetical underlying distribution. The use of either A 2 or 
A 2 * for the A-D statistics depends on how well the dis- 
tribution parameters are known a priori. The A 2 * values 
are modifications for cases where the distribution param- 
eters are not known a priori and must be estimated from 
the Xi values. If the input parameters are known before- 
hand, then the A 2 values should be used for comparison 
with critical value tables. From Equation JH), it is clear 
that A 2 * approaches A 2 for large n. 

In this analysis, we take $(xj) to be the CDF of a 
Gaussian distribution, given as: 



^=K i+rf (^)) < 9 > 

where Xi is the radial velocity of the galaxy group 
members, arranged from lowest to highest, // is the 



2.2. Monte Carlo Simulations 

We test the claim of lStephensI (| 19 74( 1 and DA86 that 
the A-D is the most reliable test for Gaussianity for small 
sample sizes, by performing Monte Carlo simulations of 
the x 2 > Kolmogorov, and A-D tests. We perform 30,000 
iterations for a variety of sample sizes, with 5 < n < 50 
and drawing from a random Gaussian distribution with 
input values of fi = Q and a = 1.0, to determine the 
reliability of the tests and how accurately each test can 
reproduce published critical values. 

The results of the x 2 Monte Carlo simulations are 
shown in Figure [IJ where we have plotted histograms 
of x 2 /DOF for our various sample sizes. Ideally a peak 
should occur at x 2 /DOF = 1.0 for the x 2 test, but we 
see that there is significant scatter in the histograms for 
n < 30 (Figure [T]). Most notable is the n — 5 histogram 
in Figure[TJ which has two peaks at x 2 /DOF > 10, much 
higher than the a = 0.10 critical value (i.e., the x 2 value 
above which 10% of values fall) of 2.41 (DOF = 1). Thus 
for small n, the x 2 test tends to overestimate the num- 
ber of failed/non-Gaussian samples. It is only for n > 30 
that we see the expected peak of 1.0. Unfortunately, the 
majority of the CNOC2 groups have n mem bers < 30, so 
use of the x 2 test to classify galaxy group dynamics could 
result in the false identification of many groups with non- 
Gaussian velocity distributions. 

The results of the Kolmogorov Monte Carlo simula- 
tions are shown in Figure [2 where we show histograms 
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Fig. 1. — Monte Carlo simulations of the x 2 test for different 
sample sizes, using a Gaussian random number generator with in- 
put values of fi = 0.0 and <T int = 1.0, and 30 000 iterations. The 
histograms are generated with a bin width of 0.5. For n < 30, the 
X 2 test completely fails to recover the input distribution. This is 
most obvious for the n = 5 case, where the histogram shows two 
peaks at x 2 /DoF = 10, 12, instead of the expected peak value of 
X 2 /DoF = 1.0. Only for n > 30 do the simulations recover the 
expected peak value. 

of the computed D* values and indicate the a = 0.10 
critical value of 1.224 (DA86) with a dotted vertical line. 
Unlike the histograms for the x 2 test, we see no scatter 
in the D* values, even at n = 5. The histograms of the 
D* values remain remarkably stable over the sample size 
range, indicating that the Kolmogorov test is reliable for 
small n. To determine if our simulations produce the ex- 
pected D* critical values, we compute these values from 
our histograms (Figure [2]) and compare them to pub- 
lished values cited in DA86. For each sample size, we are 
able to reproduce all of the given critical values for the 
case where the input distribution parameters, y, and ex, 
are known. 

In Section 2.1, we discussed the use of the A 2 * values 
for the A-D test, but this is a modification for the case 
where the distribution parameters are unknown. When 
the input parameters are known a priori, as is the case 
with our Monte Carlo simulations, DA86 state that no 
modification for the A-D test is needed and one should 
use the A 2 values when comparing to critical value ta- 
bles. The results of the A-D test simulations are shown 
in Figure [3l where we plot histograms of the computed 
A 2 values and indicate the a = 0.10 critical value of 1.933 
(DA86) with a dotted vertical line. The histograms are 
similar to those of the Kolmogorov test, showing no scat- 
ter in the A 2 values over the entire sample size range. 
The stability of the A-D test, even at n = 5, supports 
the claim of DA86 that the statistic is reliable for all 
n > 5. We compute the a — 0.10 critical values for the 
A-D test from the histograms in Figure [3] and find that 
our values are in agreement with those found in DA86. 
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Fig. 2. — Monte Carlo simulations of Kolmogorov test for differ- 
ent sample sizes, using a Gaussian random number generator with 
11 = 0.0 and <7i n t = 1.0, and 30,000 iterations. The histograms of 
the D* values are plotted using a bin width of 0.05. The results of 
the simulations show that even at n = 5, the Kolmogorov test is 
able to recover the input distribution. The histograms for n = 5 
to 50, consistently reproduce the expected peak values, indicating 
that the test is reliable over a wide sample range. The vertical dot- 
ted line represents the D* value above which 10% of the values lie, 
our computed values are in complete agreement with the known a 
= 0.10 critical value of 1.224 (DA86). 




A 2 

Fig. 3. — Monte Carlo simulations of A-D Test for different sam- 
ple sizes, using a Gaussian random number generator with input 
values of fi = 0.0 and <r; nt = 1.0, and 30,000 iterations. The his- 
tograms of the A 2 values are plotted using a bin width of 0.2. The 
results of the simulations show that even at n = 5, the A— D test 
is able to recover the input distribution. The histograms for n = 5 
to 50, consistently reproduce the expected peak values, indicating 
that the test is reliable over a wide sample range. The vertical dot- 
ted line represents the A 2 value above which 10% of the values lie, 
our computed values are in complete agreement with the known a 
= 0.10 critical value of 1.933 (DA86). 
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2.3. Power Comparisons of the Kolmogorov and A-D 

Tests 

Our Monte Carlo simulations indicate that the % 2 test 
is indeed an unreliable statistic for testing Gaussianity 
in small sample sizes, as suggested by DA86. Thus, 
we can eliminate the use of this test for classification 
of the galaxy group dynamics. However, the results 
of our simulation also show that both the Kolmogorov 
and A-D statistics are reliable down to n — 5. In or- 
der to determine which test is most suitable for distin- 
guishing between Gaussian and non-Gaussian systems, 
we perform power studies of the Kolmogorov and A-D 
tests. Monte Carlo simulations of both tests are per- 
formed, using a skewed G aussian distribution, F(xi) 
(jAzzalini fc Capitaniolll999| ). given by: 

F(xi) = 2<f>{x i )<P{a s x l ) (10) 



where 4>{x) is the probability density function of a Gaus- 
sian distribution, with /i = 0.0 and a — 1.0, <&(xi) is the 
CDF of a Gaussian distribution (Equation ([9])) and as 
is known as the shape or slant parameter. The value of 
as is proportional to the skewness of the Gaussian distri- 
bution, with higher values of as producing more skewed 
distributions and an as = 0.0 producing a non-skewed 
Gaussian. 

We draw various sample sizes, 5 < n < 100, from a 
Gaussian random distribution, using 30,000 iterations, 
and apply varying levels of skewness, < as < 5, to de- 
termine the rejection rate of both the Kolmogorov and 
A-D tests. The results of our power studies are given 
in Table [TJ where Column 1 indicates the test used, Col- 
umn 2 indicates the as value, and Columns 3-9 indicate 
the percentage of rejection given a specific sample size. 
The rejection rates are determined using the 10% critical 
values given in DA86. In order for a test to be consid- 
ered powerful, the simulations with high as should have 
higher rejection rates, since the underlying distributions 
are increasingly less Gaussian. From Table [TJ it is clear 
that for all levels of skewness and all sample sizes, the 
A-D statistic rejects more objects than the Kolmogorov 
test. For the as = 0.25 and n < 30 cases the two tests 
are comparable, but as one increases as, the percent- 
age of rejections for the Kolmogorov test is significantly 
lower than those of the A-D test. Looking at the rejec- 
tion rates for the as = 1.0, which is a heavily skewed 
Gaussian, it is clear that the Kolmogorov test underes- 
timates the number of non-Gaussian systems. This is 
especially evident when one focuses on the n = 30 case 
for the as = 1.0 distribution, which has a 100% rejection 
rate for the A-D test, but only an 85% failure rate for 
the Kolmogorov. 

The results of our power studies indicate that the Kol- 
mogorov test is much less powerful at detecting depar- 
tures from Gaussianity than the A-D statistic, w hich is 
in agreement with the findings of I Stephens) (|1974f ). The 
strongest evidence for this claim is shown in the as = 5.0, 
a completely non-Gaussian distribution, and n = 5 sim- 
ulation, where the A-D test rejects 100% of the sample 
while the Kolmogorov test only rejects 75% of the sam- 
ple. 



TABLE 1 

Results of the Power comparisons of the Kolmogorov and 
A-D tests using various skewed Gaussian distributions 3 -. 



Test 


as 


5 


10 


15 


20 


n 
25 


30 


40 


50 


100 


A~ 


0.25 


15 


17 


19 


21 


24 


27 


32 


.37 


64 


D* 


0.25 


13 


15 


16 


18 


20 


22 


26 


29 


16 


A 


0.50 


27 


38 


48 


57 


66 


74 


86 


93 


100 


D* 


0.50 


20 


27 


34 


41 


47 


53 


6.3 


72 


95 


A 2 


0.75 


43 


62 


77 


87 


93 


96 


99 


100 


100 


D* 


0.75 


28 


42 


55 


66 


74 


81 


90 


96 


100 


A 2 


1.0 


59 


81 


92 


97 


99 


100 


100 


100 


100 


D* 


1.0 


35 


57 


72 


8.3 


91 


85 


99 


100 


100 


A 2 


1.5 


80 


96 


99 


100 


100 


100 


100 


100 


100 


D* 


1.5 


50 


79 


91 


97 


99 


100 


100 


100 


100 


A 2 


2.0 


90 


99 


100 


100 


100 


100 


100 


100 


100 


D* 


2.0 


67 


86 


97 


99 


100 


100 


100 


100 


100 


A 2 


5.0 


100 


100 


100 


100 


100 


100 


100 


100 


100 


D* 


5.0 


75 


96 


100 


100 


100 


100 


100 


100 


100 



3. APPLYING THE TESTS TO DATA 
3.1. The Sample 

We now apply the x 2 , Kolmogorov, and A-D tests to 
a sample of galaxy groups identified in the second Cana- 
dian Network for Observational Cosmology Redshift 
Survey (CNOC2) in t he redshift range of0.1<z<0.6 
(|Carlberg et alJ l2001h . The CNOC2 survey observed 
~ 4 x 10 4 galaxies covering four patches, 1.5 deg 2 
in area, in the UBVRcIc bands down to a limiting 
magnitude of Rc = 23.0. Spectra of more than 6000 
galaxies were obtained with the MOS spectrograph on 
the Canada-France-Hawaii Tel escope (CFHT ) , with 
48% completeness at R c = 21.5 (|Yee et al.ll2000l) . 

Over 200 galaxy groups were identified using a friends- 
of-friends percolation a lgorithm in th e CNO C2 survey 
(|Carlberg et al.ll2001h . IWilman et all (|2005h (hereafter 
W05) obtained deeper spectroscopy taken with the low 
dispersion survey spectrograph (LDSS2) on the Mag- 
ellan telescope and then redefined group membership 
with mo re relaxed algo r ithm parameters than those 
used by I Car lb erg et alJ (|2001h . The original search 
parameters were optimized so that the group-finding 
algorithm would identify dense, virialized groups, while 
the W05 sample included looser group populations. 
We would like to quantify how many of these groups 
have more complex velocity distributions, potentially 
identifying merger products or systems in the early 
stages of virialization. 

3.2. Estimation of Distribution Parameters /! and a 

The x 2 ; Kolmogorov and A-D statistics were devel- 
oped under the assumption that all of the parameters 
of the underlying distribution were completely specified. 
Modifications to the statistics, with the use of Monte 
Carlo simulations, have been carried out to allow these 
tests to be applied to cases where the distribution pa- 
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rameters are not completely known a priori, but must 
be estimated from the data (DA86). 

The parameters required to define a Gaussian distri- 
bution are the mean, ji, and the dispersion, a. In the 
analysis described in Section 3.3, [i is calculated using 
the standard mean, and the velocity dispersions are es- 
timated with the Gapper algorithm, given by: 



(^Gapper 



n{n — 1) 



E 



Wig i 



(12) 



where Wi = i{n — i), gi = Xi + \ — Xi, here the ordered Xi 
values are given by the observed radial velo cities of the 
roup members. For small number statistics. iBeers et al.l 
1990) recommend the Gapper Estimator over the canon- 
ical rms standard deviation, as this algorithm is insensi- 
tive to outliers and thus more accurately reproduces the 
true dispersion of the system. 

To ensure that the Gapper Estimator is a more reli- 
able method of computing the dispersion of a system, 
we perform Monte Carlo simulations of both the Gap- 
per algorithm and the canonical rms standard deviation. 
We draw various sample sizes (n — 5, 15, 20 50) f rom a 
Gaussian random number generator ( Galassi 2006) with 
the inputs fi = 0.0 and ^intrinsic = 100 and then compute 
the dispersion using the two aforementioned methods. 

The results are shown in Figure [4J where we have plot- 
ted histograms of the output dispersions for each sample 
size. Figure 0] shows that for the n — 5 case the canoni- 
cal rms standard deviation underestimates the true dis- 
persion by 25% and the distribution is heavily skewed 
to lower values. Although the distribution of the Gapper 
Estimator is also skewed, the peak of the distribution oc- 
curs at the true dispersion value of 100, indicating that 
this method is indeed insensitive to outliers. As we in- 
crease the sample size, n = 15 and 20, we can see that 
the rms dispersion continues to underestimate the true 
velocity dispersion, but also that the two methods begin 
to converge. The histogram for the n = 50 case shows 
that the rms dispersion and the Gapper Estimator both 
correctly identify the true dispersion. Given the small n 
of the CNOC2 groups we choose the Gapper algorithm 
to estimate the velocity dispersions of the groups. 



3.3. Comparison of Tests 
3.3.1. Statistical Analysis 

The x 2 : Kolmogorov and A-D tests are applied to a 
subset of 62 CNOC2 groups containing at least 10 mem- 
bers per group. We chose this membership cut based 
on the result of our n = 5 x 2 Monte Carlo simulations, 
as shown in Figure [TJ which indicated that the test was 
completely unreliable at small n. This cut also helps to 
minimize the effect of binning. The potential compli- 
cations introduced by the choice of bin width are also 
reduced by using the optimal bin width formula (Equa- 
tion J2J). 

Despite our attempts to minimize the various uncer- 
tainties due to binning, we still encounter groups that 
do not meet the minimum number of bins requirement, 
N = 4, for our application of the x 2 test, with five out 
of the 62 groups in the sub-sample having N < 4. Thus, 
we apply the \ 2 test to 57 groups (n > 10 and N > 4). 
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Fig. 4. — Monte Carlo simulations of the Gapper Estimator 
and the rms dispersion methods for n = 5 (top left), n = 15 (top 
right), n = 20 (bottom left) and n = 50 (bottom right). For each 
histogram, we compute the Gapper and rms dispersions using ve- 
locities generated from a random Gaussian distribution with input 
values of fi = 0.0 and <7i nt = 100. This process was done with 
30,000 iterations. The dotted lines indicate dispersion values com- 
puted using the canonical rms standard deviation and the solid 
lines are values calculated using the Gapper algorithm. For n = 5, 
the histogram for the rms method is skewed and underestimates 
the true dispersion by 20%. For the same sample size, the peak of 
the Gapper Estimator histogram is located at the true dispersion 
value of 100, even though the histogram is also skewed, indicating 
that the Gapper Estimator is insensitive to outliers. For the n = 50 
case, the two methods begin to converge, with both the rms and 
Gapper algorithms picking out the true dispersion value. 

The modification to the D-statistic (Equation ©) 
given in Section 2.1 is used for the case when the input 
distribution parameters are known a priori (i.e., Monte 
Carlo simulations in Section 2.2). When applying the 
Kolmogorov test to real data sets, one must estimate the 
distribution parameters, \x and cr, and thus the modifi- 
cation for the D value becomes (DA86): 



D* =D\ vT") - o.oi + 



0.85 

7b) 



(13) 



where D is given by Equation (|3j). 

Similarly, in Section 2.2 we used the A 2 values, but for 
real data sets one must use the A 2 * values (Equation (JSJ) 
in order to properly apply the A-D statistic. Since, the 
A-D and Kolmogorov tests do not require binned data, 
we are able to apply both tests to all 62 groups (n > 10). 

The results of our analysis are presented in Table [2 
where Column 2 indicates the number of groups used 
to perform the specified test and Column 3 indicates the 
number of groups that failed at the 0.10 significance level. 
Our initial findings show that these three tests differed 
in the number of rejected (non-Gaussian) groups, with a 
21% rejection rate for the x 2 test, a ~ 11% rejection rate 
for the Kolmogorov test and a ~ 16% rejection rate for 
the A-D test. 

The U-statistic of the Kolmogor ov test is often used 
to test for goodness-of-fit, but both lStephensI (|1974h and 
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DA86 do not recommend its use for testing Gaussian 
distributions, based on its lack of power. The results 
of our own power studies, in §2.3, also indicate that 
the Kolmogorov test lacks power and is unable to de- 
tect real departures from Gaussianity. The relatively low 
non-Gaussian detection rate in the CNOC2 group sub- 
sample, with only seven out of the 62 CNOC2 groups 
failing at the 0.10 significance level, further supports 
the notion that the Kolmogorov test suffers from under- 
rejection, suggesting that the Kolmogorov test may not 
be reliable method for dynamical classification. 



TABLE 2 

Results of the x 2 , Kolmogorov and A-D tests applied to a 

SAMPLE OF CNOC2 GROUPS WITH n > 10. 



passed x 2 
passed A-D 
-passed K-S 



Group 110 



failed X s 
failed A-i 
failed K-l 



Group 208 



3 
Z 



Test 



Pearson's x 2 
Kolmogorov 
A-D 



Number 
of Groups 



Number of 
Failed Groups 



Percent of 
Failed Groups 



Significance 
Level 



failed x 2 
passed A- 
passed K- 



1 



passed x 3 
failed A-D 
passed K-S 



Group 239 



57 
62 
62 



12 
7 
10 



21 
11 

16 



0.10 
0.10 
0.10 a 



3.3.2. Velocity Distributions 

The reliability of the A-D test over the \ 2 an d Kol- 
mogorov tests is further demonstrated when one looks 
at the velocity distributions of specific groups. In Figure 
03 we show the velocity distributions of four CNOC2 
groups that have either failed the \ 2 , Kolmogorov, 
and A-D tests (non-Gaussian groups), passed all three 
tests (Gaussian groups), failed the x 2 test but passed 
the Kolmogorov and A-D tests, or have passed the % 2 
and Kolmogorov tests but failed the A-D test. The 
histograms are made using Heald's optimal bin width 
(Equation ([2])) and are over-plotted with a Gaussian 
distribution generated using the estimated mean and 
dispersion of the group. Although the Kolmogorov and 
A-D tests do not use binned data, we can look at the 
velocity distributions of groups classified as Gaussian 
or non-Gaussian by the various tests to see if there are 
obvious visual departures from normality. 

Group 110 is classified as Gaussian by the x 2 , Kol- 
mogorov, and A-D tests and from Figure we can see 
that the shape and mean of the velocity distribution 
agrees well with the underlying Gaussian distribution. 
Group 208 (Figure O is a group that has been classified 
as non-Gaussian by all three tests, and it is evident 
that the velocity distribution is non-Gaussian, as the 
histogram shows a double peak. 

While Groups 110 and 208 are examples of systems 
with obvious Gaussian or non-Gaussian features in their 
velocity distributions, this distinction is not so clear for 
several of the CNOC2 groups. Group 366 represents 
groups that have failed the \ 2 test, but have passed the 
Kolmogorov and A-D tests. The histogram for Group 
366 (Figure E|) , shows no obvious departures from the 
Gaussian distribution and despite the use of Heald's 
optimal bin width, the relatively low group membership 
(n = 15) results in binning issues and causes the group 
to be rejected by the % 2 test. The Kolmogorov and A-D 
tests use ordered rather than binned data, and thus do 



Radial Velocity (km s -1 ) 

Fig. 5. — Velocity distribution histograms for various CNOC2 
grou ps. The dotted lines show Gaussian distributions computed 
using the estimated /i and a of the group. The histograms are 
generated using Heald's optimal bin width (Equation J2)), thus the 
bin widths differ between groups. Group 110 represents groups that 
have passed the x 2 , Kolmogorov, and A-D tests (at the 10% level) 
and are classified as Gaussian. Group 208 represents groups have 
failed all three tests and are thus classified as non-Gaussian. The 
histogram for Group 208 appears non-Gaussian, showing a double 
peak. Group 366 represents groups that have failed the x 2 t es t but 
passed the Kolmogorov, and A-D tests. The histogram for Group 
366 shows no obvious departures from the Gaussian distribution, 
but suffers from binning issues, resulting in rejection by the x 2 
test. Group 239 represents groups that have passed the x 2 an d 
Kolmogorov test, but failed the A— D test. The histogram for this 
group appears skewed and has a mean that is slightly shifted from 
that of the Gaussian distribution. Despite these features, only 
the A-D test is able to detect these departures and rejects Group 
239 as having an underlying Gaussian distribution. Note that the 
Kolmogorov and A-D tests do not require binned data and are 
computed using ordered data. Also, note that the y-axes differ 
between plots. 

not suffer from the issues introduced when binning small 
sample sizes. 

Group 239 represents groups that have passed the 
X 2 and Kolmogorov tests, but not the A-D test. This 
histogram for this group (Figure [5]) appears skewed and 
and has a mean that is slightly shifted from that of 
the Gaussian distribution. Despite these non-Gaussian 
features, only the A-D test is able to detect these de- 
partures and rejects Group 239 as having an underlying 
Gaussian distribution. This group highlights the fact 
that the Kolmogorov test lacks power, as discussed in 
Section 2.3, and is unable to detect slight departures 
from Gaussianity. 



4. APPLICATION OF THE ANDERSON-DARLING TEST TO 
THE CNOC2 GROUPS 

Having identified the A-D test as the best statisti- 
cal tool for galaxy group dynamics analysis, we can 
now proceed to classify the CNOC2 groups as being ei- 
ther relaxed (Gaussian) or dynamically complex (non- 
Gaussian) systems. In our analysis in Section 3, we ap- 
ply only a minimum membership cut, but proper classi- 
fication requires a more detailed treatment of the group 
catalog. Group members were reallocated by W05, in- 
cluding some galaxies at large group-centric distances to 
study radial trends. We apply a 1 Mpc radius cut to each 
of these re-identified CNOC2 groups. Figure [6] shows a 
histogram of the number of group members, after the 
radius cut, with the majority of groups having 5-11 
members. 
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Fig. 6. — Histogram of the number of members per CNOC2 
galaxy group in our sample, after a 1 Mpc radius cut and consid- 
ering only groups with n > 5. 

For the distribution parameters, we estimate /i as the 
mean group member velocity and a by computing the in- 
trinsic velocity dispersion (W05), obtained by first com- 
puting the observed dispersion, given by: 

(Jobs — 1.135 * (TQapper, (14) 

where the factor of 1.135 is a correction applied by W05 
for their redshift space cut of 2^^-. The next step is to 
compute the rest-frame velocity dispersion: 



where z is the redshift of the group centroid. The final 
step involves removing the measurement uncertainty of 
each galaxy, (6(v)), from the rest-frame dispersion, as: 

<4t = "rest ~ (S(v)f , (16) 



where (6(v)) = 142 km s" 1 for the (LDSS) data and 
(5{v)} = 103 km s- 1 for the original CNOC2 data (W05). 

We compute the intrinsic velocity dispersion for all 
W05 re-identified CNOC2 groups with n me mbers > 5 
within 1 Mpc, then classify the groups based on the com- 
puted A 2 * values. A crucial step in any statistical anal- 
ysis is to identify the appropriate significance level or 
critical value used for classification. The critical points 
of the A-D test, as with all EDF statistics, change de- 
pending on the accuracy with which one knows the input 
distribution parameters (DA86). For a Gaussian distri- 
bution, there are four cases to be considered: 

• case 0: both fi and a 2 are known a priori, i.e., a 
fully specified distribution; 

• case 1: a 2 is known and \x must be estimated; 

• case 2: /i is known and a 2 must be estimated, and; 

• case 3: both /i and a 2 must be estimated . 

Each of these cases result in different critical values, 
which can greatly alter the number of rejections. For ex- 
ample, the 5% critical value is 2.492 for case 0, 1.087 for 
case 1, 2.308 for case 2 and 0.752 for case 3 (DA86) 
When testing for Gaussianity in data sets, IStephensI 
(|1974h suggests that case 3 is the most practical choice, 
as distribution parameters are in general estimated and 
not known a priori. In this situation, there are two ap- 
proaches one can take in distinguishing between Gaus- 
sian and non-Gaussian systems; the first involves com- 
paring the computed A 2 * values with critical value tables 
and the other uses the A 2 * values to directly compute the 
significance level a, which gives the probability that the 
data comes from an underlying Gaussian distribution. 
We choose to follow the latter method and compute a 
using the formula: 

a = acxp(-A 2 * /b), (17) 

where a = 3.6789468 and b = 0.1749916, and b oth fac- 
tors are determined via Monte Carlo methods (|Nelsonl 
fl998h . 

Using Equation (fl"7|) . we are able to determine the 
probability of whether or not each CNOC2 group had 
a Gaussian velocity distribution, classifying all groups 
with a < 5% as being non-Gaussian. The results of the 
A-D test are given in Tabled where we see that ~32% 
of the CNOC2 groups are classified as non-Gaussian at 
the 5% significance level and also that the n, z, and a 
are similar for the Gaussian and non-Gaussian groups. 
Using this classification scheme, we can now investigate 
specific group properties to determine if there are any 
obvious trends or differences between the Gaussian and 
non-Gaussian groups. The properties of the CNOC2 
groups that are classified as dynamically complex (non- 
Gaussian) are given in Table |4j 

The intrinsic velocity dispersions of the non-Gaussian 
groups, shown in Figure [7] show no obvious trend, with 
the values ranging from ~40 to 850 km s _1 , so the more 
dynamically complex systems are not restricted to low 
or high velocity dispersions. Similarly, the intrinsic ve- 
locity dispersions of the Gaussian groups (Figure [7]) also 
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TABLE 3 

Anderson- Darling Classification of the CNOC2 groups 

WITH n > 5 AFTER THE 1 MPC CUT. 



Classification 


# of groups 


% of all 


groups 


n 


z 


0int 
















(km s ) 


Gaussian 




72 


~ 6 


i% 


9 


0.30 




non-Gaussian 


34 


~32% 


9 


0.37 


oZ 1 






TABLE 4 










Properties of non-Gaussian CNOC2 Groups. 




group ID 


n a 


Cobs 


&rest 


Vint 




A' z * 


a 






(km s" 1 ) 


(km s" 1 ) 


(km s~ 


*) 






24 


10 


151 


111 


42 




13.4 


2.69o-33 


29 


9 


465 


338 


322 




0.780 


0.0426 


30 


11 


428 


307 


289 




1.18 


0.00424 


32 


8 


755 


542 


532 




0.945 


0.0166 


33 


6 


235 


167 


126 




1.61 


0.000375 


34 


6 


254 


173 


134 




1.67 


0.000271 


38 


16 


1208 


800 


793 




1.66 


0.00940 


120 


6 


248 


200 


171 




0.752 


0.0499 


127 


6 


180 


137 


91 




1.18 


0.00427 


128 


5 


586 


445 


433 




1.22 


0.00342 


129 


5 


371 


282 


262 




0.813 


0.0353 


132 


8 


542 


399 


382 




1.31 


0.00202 


135 


7 


379 


271 


251 




1.03 


0.0105 


138 


23 


1064 


740 


731 




0.973 


0.0142 


139 


10 


363 


252 


226 




1.39 


0.00128 


140 


5 


219 


149 


100 




1.18 


0.00432 


202 


5 


423 


356 


341 




0.836 


0.0309 


211 


8 


234 


184 


153 




0.892 


0.0225 


213 


7 


581 


446 


434 




0.763 


0.0469 


218 


5 


443 


338 


322 




0.880 


0.241 


221 


6 


171 


126 


73.1 




2.37 


4.863-06 


226 


25 


1159 


853 


847 




1.04 


0.00940 


230 


10 


212 


153 


113 




1.25 


0.00298 


233 


7 


795 


568 


559 




1.57 


0.000463 


241 


6 


316 


222 


197 




0.757 


0.0487 


244 


15 


356 


242 


211 




1.35 


0.00168 


312 


8 


424 


344 


328 




1.14 


0.00545 


336 


5 


965 


708 


700 




0.947 


0.0164 


338 


9 


503 


367 


352 




0.822 


0.0336 


346 


26 


613 


446 


434 




1.26 


0.00277 


351 


6 


243 


176 


143 




0.814 


0.0352 


357 


7 


234 


168 


133 




1.80 


0.000128 


358 


11 


409 


294 


275 




0.776 


0.0436 
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878 
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Fig. 7. — Histogram of the computed intrinsic velocity disper- 
sions ((Ti n t) for CNOC2 galaxy group, with n > 5 after a 1 Mpc 
radius cut, which have been classified as either Gaussian (solid line) 
or non-Gaussian(dashed line) by the A-D test. 
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cover a wide range in values, ~75 - 725 km s _1 . Fig- 
ure [8] shows a histogram of the number of members in 
the CNOC2 groups, with the solid line representing the 
classified Gaussian groups and the dashed line indicat- 
ing the non-Gaussian groups. Here we also see a wide 
range in the number of members for the Gaussian and 
non-Gaussian groups, suggesting that the A-D test is not 
biased towards a specific sample size. 

5. VELOCITY DISPERSION PROFILES 

One way to investigate the differences between relaxed 
and dynamically complex galaxy groups is to study their 
VDPs. If a group classified as non-Gaussian is in fact dy- 
namically more complex than a Gaussian one, then the 
corresponding VDP may exhibit different features from 
those of relaxed systems. In the i r ana lysis of merging 
clusters, iMenci fc Fusco-Femianol (|1996h find that radi- 
ally increasing VDPs indicate significant galaxy merging 



^members 

Fig. 8. — Histogram of the number of members per CNOC2 
galaxy group, with n > 5 after a 1 Mpc radius cut, which have been 
classified as either Gaussian (solid line) or non-Gaussian (dashed 
line) by the A-D test. 



in the cluster core. iGirardi et al.1 (|1996( ) also suggest that 
the presence of neighboring clusters resulted in a VDP 
with an initially flat profile, which then increased signif- 
icantly at larger radii. 

We follow the method outlined in lBergond et al. (2006) 
to generate VDPs for the CNOC2 groups in which the ra- 
dial velocities are binned with an exponentially weighted 
moving window. The window function is given by: 



Wi (R) = — exp 



(R - Rif 



24 



(18) 



10 



where an is the width of the window, which can be con- 
stant or a function of radius R, and the R^s are the radial 
positions of the members of the system. The projected 
velocity dispersions are then defined as: 



(Tp(R) 



/ £« Wi(R)(xi-x)* 



(19) 



where the Xi S 0X6 the radial velocities and x is the mean 
velocity of the system. 

This "moving window" prescription for computing 
VDPs takes into account the contribution of every ra- 
dial velocity measurement at each value of R. It also 
removes the restriction of computing binned projected 
velocity dispersion. Instead, a smoothed profile can be 
generated, since the projected dispersions can be com- 
puted at any radius, not just at the radii corresponding 
to the observed velocities. 

In order to use this method to probe the dynamics of 
a system, one must be careful with the choice of win- 
dow width, an, as a window that is too large can wash 
out real features and, a window that is too small tends 
to add spurious features in the profile. VDPs of sys- 
tems with very small n may also contain unphysical fea- 
tures, similar to profiles made with small window widths. 
Since the projected velocity dispersions are computed us- 
ing weighted values of every velocity measurement in the 
data, any large individual deviations from the mean can 
alter the overall shape of the profile. This effect is sig- 
nificantly more pronounced in small samples, as there 
are not enough data points to counteract or outweigh 
the effects of an outlier. With testing, a window width 
of 0.35 Mpc, approximately one-third the value of the 
maximum radius is selected. We also enforce a minimum 
group membership of 20 members, after our 1 Mpc ra- 
dius cut, to ensure that any visible trend in the VDPs is 
not a result of outliers. 

There are five CNOC2 groups that meet our mini- 
mum group membership criteria (n > 20), two of which 
are classified by the A-D test as Gaussian, Groups 110 
(n = 26) and 308 (n = 25) and three as non-Gaussian 
systems, Groups 138 (n = 23), 226 (n = 25), and 346 
(n = 26). The VDPs for these groups are shown in 
Figure El where the Gaussian groups are shown with 
filled symbols and the non-Gaussian groups with open 
symbols. Both Gaussian groups have decreasing profiles, 
while two of the three non-Gaussian groups (138 and 346) 
have increasing profiles. The VDP for Group 226 (Figure 
[9]) does not exhibit the same overall trend as the other 
non-Gaussian groups, as the profile increases initially but 
turns over at roughly 0.4 Mpc. Although the profile for 
Group 226 does not continually increase, it does show dis- 
tinct features from the profiles of the Gaussian groups. 
It is impossible to make general statements on the overall 
shape of all Gaussian or non-Gaussian groups based on 
these five groups alone, but the results do support our 
claim that galaxy groups classified as non-Gaussian by 
the A-D test are dynamically complex systems. We are 
unable to determine by the VDPs alone whether these 
non-Gaussian groups are indeed undergoing a merger, 
but the differences between Gaussian and non-Gaussian 
group profiles do suggest that these two types of systems 
are dynamically distinct. 



A closer inspection of the individual profiles of the 
Gaussian groups suggests another interesting result. The 
profile for Group 110 shows an initial decrease with an 
eventual flattening of the profile towards the outer ra- 
dius, a gener a l tren d that was observed in clusters by 
iGirardi et all (|1996h . The VDP of Group 308 does not 
exhibit the same trend of a flattened profile, continually 
decreasing towards the edge of the group. 
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Fig. 9.— VDPs for the CNOC2 groups with n > 20 after a IMpc 
radius cut and using a constant window width of 0.35 Mpc. The 
open symbols indicate groups classified as non-Gaussian and the 
closed symbols indicate those classified as Gaussian. 



6. SUMMARY 

Analysis of galaxy group dynamics requires the use 
of tools that are reliable and powerful for small sam- 
ple sizes. We have applied three goodness-of-fit tests, 
the x 2 : Kolmogorov, and A-D tests, to a subset of the 
CNOC2 groups in order to determine which test can best 
classify galaxy group dynamics. Based on our initial ap- 
plication of the aforementioned tests and on the results 
of our Monte Carlo simulations and power studies, we 
conclude that the A-D test is the most reliable statistic 
to distinguish between relaxed (Gaussian) and dynami- 
cally complex (non-Gaussian) groups. 

The results of our Monte Carlo Simulations for the 
Gapper Estimator (Equation (|12[) ) and rms dispersion 
calculations indicate that for small sample size, n < 50, 
the Gapper algorithm is a more accurate estimate of 
the true velocity d ispersion, which is in agreement with 
iBeers et al.1 (|1990f ). 

We then apply the A-D test to all CNOC2 groups with 
^members > 5, after a 1.0 Mpc radius cut, using the mean 
velocity of the group members as the estimated \x and 
the intrinsic velocity dispersion (Equation (|16|1 ) as the 
estimated a. The groups are then classified as being in 
either a relaxed (Gaussian) or complex (non-Gaussian) 
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dynamical system, based on their computed significance 
levels. The results of our analysis indicate that 34 of the 
106, or ~ 32%, of the sample of CNOC2 groups were 
classified as non-Gaussian. 

To investigate our claim that classified non-Gaussian 
groups are indeed dynamically more complex than Gaus- 
sian ones, we look at the VDPs of five CNOC2 groups 
with n mem bers > 20. Analysis of the resulting profiles 
indicates that; 

1. the profiles of the two Gaussian groups (110 and 
308) show a declining velocity dispersion with ra- 
dius; 

2. two non-Gaussian groups (138 and 346) have 
rising profiles, a possible sig nature of mergers 
(M end &; Fusco-Femianolll996l h and; 

3. the profile of Group 110 flattens towards lar ger 
radii, a trend observed by iGirardi et al.l (|1996f ) in 
galaxy clusters. 

The VDPs of the Gaussian and non-Gaussian groups 
are distinct, supporting our claim that the classified 
non-Gaussian groups are dynamically different from the 
Gaussian systems. 

We intend to investigate other observed group proper- 
ties, such as morphology, colour and star formation rates, 
for correlations between the dynamical state of a group 



and its properties. We also plan to study the dynamics of 
X-ray selected groups (J . Connelly, et al, in preparation; 
iFinoeuenov etaD (pOO^ I in the CNOC2 fields. X-ray 
bright groups are of particular interest, as only groups 
in a relaxed dynamical state are expected produce ex- 
tended X-ray emissions. The application of the A-D test 
to these groups will allow us to test this hypothesis. Fu- 
ture work will also include a detailed study of simulated 
galaxy groups in order to understand the relationship 
between velocity dispersion and mass for systems in dif- 
ferent dynamical states. Simulations will also allow us to 
quantify how projection and contamination by interlop- 
ing galaxies affect our measured velocity distributions. 

We conclude that the A-D goodness-of-fit test is a re- 
liable statistical tool. Although we have used this test to 
determine departures from normality, its application is 
not restricted to Gaussian distributions and can be used 
with many continuous or discrete distributions. Not only 
is this test reliable and powerful, especially when deal- 
ing with small sample sizes, but its application is simple 
and has the potential to be useful is many other areas of 
astronomy. 
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